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Abstract 

The best fit line from minimizing the sum of the squared perpendicular 
error is given. 


1 Introduction 

With simple linear regression and data {(cci, yi),..., (x„, y„)}, we will minimize 
the sum of the squared vertical errors to find the ’’best-fit” line. 


2 Minimizing Squared Perpendicular Errors 


For a right triangle with legs {a, b} and hypotenuse c with h the distance from 
the intersection of the legs to the hypotenuse. From similar triangles a/h = c/b, 
so = {abY/{o? + 6^). The sum of the squared perpendicular errors SSEp for 
the fitted line h{x) = do + l3ix has 
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as in the simple linear regression case 

Set ^yy = = Er=iM“P- 
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and the equation for the slope (3i is given by 
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with roots (provided Sxy ^ 0) 
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If Sxy 7 ^ 0, then then there are two roots from Equation |S1 one positive and 
one negative. If Sxy > 0, then the minimum of SSEp{(3i) is the positive root, 
and if Sxy < 0, the minimum is the negative root. This follows from the fact 
that there are two critical points for -^SSEp{/3i) one to the left of zero and 
one to the right of zero, and that the value at zero of ■^SSEp{0) = —2Sxy 
If Sxy = 0, the minimum of SSEp{Pi) is found from Equation 0| and (1) is 
Syy with /3i = 0 for Syy < Sxx', (2) is Sxx with /3i = oo for Sxx < Syy; and (3) 
is Sxx = Syy with any value of /3i for Sxx = Syy. 


3 Example 

For the data {(0,0), (1,1), (1,0), (0,0)} with {x = 1/2, y = \l^,Sxx = ^,Syy = 
3/4:, Sxy = 4/2, p = -s/S/Sj, the ’’best” fit straight line has /3i = 0.78078 and 
3o = -0.14039. 


4 Summary 

We have investigated the fitted linear equation to data using as the criterion for 
fit the minimized sum of squares of the perpendicular errors. 
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